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M th d and Apparatus For Efficient Implem ntation 
of Round Robin Control Unit 



Field of Invention 

[0001] The field of invention relates to electronic circuit design, generally; and, 
more specifically, to a method and apparatus for the efficient implementation of a 
round robin control unit 

Background 

[0002] Figure 1 shows n queues 1 01 0 through 1 01 n -i . If the output traffic from 
each of the n queues 101 0 through 101 n -i is designed to flow through a channel 
102 having a limited amount of bandwidth resources, in typical applications, the 
simultaneous emission of output traffic from each of the n queues 101 0 through 
101 n -i "overloads" the bandwidth resources of the channel 102. That is, more 
traffic is offered to the channel 102 than it can handle. In order to prevent the 
overload of the channel 102, the total output traffic from all of the queues 101 0 
through 101 no i is throttled back causing traffic to build up within the queues 101 0 
through 101 n -i- 

[0003] As such, the queues 1 01 0 through 1 01 n -i may be said to "contend" for 
the bandwidth resources of the channel 102 - in the sense that their traffic build- 
up causes them to desire the use of the channel 102. In order to implement the 
throttling back of total traffic from all of the queues 101 0 through 101 n -i, for 
example, one of the queues may be given permission (i.e., a "grant") to release a 
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flow of output traffic while some or all of the other queues are forced to hold 
theirs. 

[0004] By distributing these grants to the queues 1 01 0 through 101 n -i in a fair 
manner, each queue will eventually be given a grant to release its output traffic. 
As such, the total flow of traffic from all of the queues 101 0 through 101 n -i is 
throttled back (e.g., so as to be commensurate with the bandwidth of the 
channel 102) and each of the queues 101 0 through 1 01 n -i has an opportunity to 
transmit. This corresponds to a fair resolution of the contention for the resources 
of the channel 102. 

[0005] A technique referred to as "round robin" may be used to implement a 
fair resolution for the contention of resources. In a round robin approach, queues 
are issued grants in a cyclical fashion. That is, for example, if each of the n 
queues 101 0 through 101 n -i has traffic to release: queue 101 0 is given a first 
grant; queue 1 01 1 is given a second grant; . . . ; queue 101 n -i is given an n th 
grant; queue 101 0 is given an (n+1) th grant; queue 101 1 is given an (n+2) th grant; 
. . . ; queue 1 01 n -i is given a (2n) th grant; etc.,. 

[0006] If a queue is "empty" (i.e., does not have traffic to send) when its turn 
arises to receive a grant, the robin robin mechanism "skips over" the empty 
queue and issues the next grant to the next queue in the cycle having traffic to 
release. For example, if queue 10^ is empty after queue 101 0 is given a grant, 
queue 101 2 is given the "next" grant (after the grant given to queue 101 0 ) if queue 
101 2 has traffic to release. If queue 101 2 does not have traffic to release, queue 
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IOI3 (not shown in Figure 1) is given the "next" grant if it has traffic to release, 
etc.. 

[0007] Note that the round robin mechanism may be applied to resource 
contentions generally. That is, applications other than a group of queues 101 0 
through 1 01 n -i that collectively feed a common channel 102 (as seen in Figure 1) 
may implement a round robin scheme. For example, common applications of 
round robin schemes include (among others) the granting of switching or routing 
resources (e.g., to streams of traffic within a network or networking system), the 
granting of processing resources (e.g., to applications or clients that use a 
computing system), etc. Regardless as to the exact environment where a round 
robin scheme is implemented, the element(s) that contend with one another for a 
resource (e.g., queues, traffic streams, software applications, client devices) may 
be referred to in a general way as the "requesting agent(s)". 
[0008] A problem with round robin schemes, however, is their complexity as 
the number of contenders "n" (e.g., the n queues 101 0 through 101 n -i observed in 
Figure 1) increases. As n increases, the round robin controlling mechanism has 
to "keep track of" more contentions and grants which results in complex and 
inefficient round robin design implementations. 
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Figures 

[0009] The present invention is illustrated by way of example, and not 
limitation, in the Figures of the accompanying drawings in which: 
[0010] Figure 1 shows n queues. 

[001 1] Figure 2 shows an embodiment of a round robin control unit that 
manages n queues in a round robin fashion. 

[0012] Figure 3a shows an embodiment of the round robin control unit of 
Figure 2. 

[0013] Figure 3b shows another embodiment of the round robin control unit of 
Figure 2. 

[0014] Figure 4 shows an embodiment of the methodology that round robin 
control unit of Figure 3a executes according to. 

[0015] Figure 5a shows an embodiment of a kth grant generation unit. 
[0016] Figure 5b shows another embodiment of a kth grant generation unit. 
[0017] Figure 6 shows an embodiment of a plurality of grant generation units 
over a range of k values. 

[0018] Figure 7 shows an embodiment of a register transfer level (RTL) 
description for a round robin control unit. 

[0019] Figure 8 shows an embodiment of an RTL description for a grant 
generation unit. 

[0020] Figure 9 shows another embodiment of an RTL level description for a 
grant generation unit. 
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Detailed Description 

Discussion of Round Robin Circuit Embodiments 
[0021] Referring to Figure 1 note that, in a sense, a round robin scheme may 
be viewed as being organized according to a hierarchy of eligibility for the grants 
that are issued over the course of a cycle. That is, for example, initially the first 
queue 101 0 is first recognized as being eligible for a grant. The remaining 
queues 101 1 through 1 01 n -i are then recognized as being eligible for a grant in 
succession (i.e., in order) from the second queue 1 01 1 to the last queue 101 n -i- 
[0022] With regard to the initial recognition of the first queue 101 0 as being 
eligible for a grant, if the first queue 101 0 is non-empty the first queue 101 0 
receives a grant; and, the second queue 1 01 1 is next recognized as being eligible 
for a grant. Alternatively, if the first queue 101 0 is empty the first queue 101 0 
does not receive a grant; and, the second queue 101 1 is recognized as being 
eligible for a grant. 

[0023] Thus regardless if the first queue 101 0 is deemed empty or non empty, 

O 

H> once the appropriate disposition has been reached as to the grant eligibility of the 
first queue 101 0 (i.e., a decision has been made to issue a grant or not to issue a 
grant to the first queue 101 1), recognition of grant eligibility passes to the second 
queue 101 1 (which is viewed as the next queue in the hierarchy) and a second 
disposition is reached for the second queue 1 01 1 (i.e., a second decision is made 
to issue a grant or not to issue a grant to the second queue 101i). 
[0024] The process of passing grant eligibility to the next queue in the 
hierarchy (e.g., from left to right as seen in Figure 1), and reaching a disposition 
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in response, is continually repeated until a disposition is reached for the last 
queue 1 01 n -i in the heirarchy. Thus, in this case, the round robin cycle can be 
viewed as the series of processes that extend from: 1) the recognizing of the first 
queue 101 0 as being grant eligible; to 2) the reaching of a disposition on the 
grant eligibility of the last queue 101 n -i. Note that because each queue is 
recognized as being grant eligible during a round robin cycle, the manner in 
which grants are issued is fair. 

[0025] Figure 2 shows an embodiment of a round robin control unit 202 that 
controls the round robin release of traffic from n queues 201 0 through 201 n -i. 
The round robin control unit 202 embodiment of Figure 2 employs "one hot" 
encoding to issue individual grants to each of the n queues 201 0 through 201 n -i- 
That is, a collection of n output signals 204 from the round robin control unit 
(shown as Grant_0 through Grant_n-1 in Figure 2) are used to individually 
present a grant to each of the n queues 201 0 through 201 n -i. 
[0026] For example, if only queue 201 0 is to be issued a grant, the Grant_0 
line 204 0 is provided an active logic signal by the round robin control unit 202 
while the remaining GranM through Grant_n-1 lines 204, through 204 n -i are 
provided an inactive logic signal. For example, if positive logic is employed, 
Grant_0 line 204 0 is provided with a "1" while the remaining GranM through 
Grant__n-1 lines 20^ through 204^ are provided with a "0". Similarly, if negative 
logic is employed, GranM) line 204 0 is provided with a "0" while the remaining 
GranM through Grant_n-1 lines 204i through 204 n -i are provided with a "1". 
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[0027] If the logic values presented on the collection of round robin control unit 
output lines 204 are viewed together as a vector, a round robin scheme can be 
implemented by shifting an active bit within the vector against a field of inactive 
bits. That is, for example, one "run through" of a round robin cycle (where each 
of queues 201 0 through 201 n -i are issued a grant) can be implemented if a 
sequence of output vectors: 

00... 001 (only Grant_0 line 204 0 active) 

00... 010 (only GranM line 204i active) 

00. . . 1 00 (only Grant_2 line 204 2 active) 

01 ...000 (only Grant_n-2 line 204 n -2 active) 

10. ..000 (only Grant_n-1 line 204 n -i active) 

are provided by the round robin control unit 202 across the grant lines 204. 
[0028] Note that within the "one-hot" vector structure listed above, the vector 
bits are ordered in significance in a manner that corresponds with the queues 
201 o through 201 n -i themselves. That is, in the embodiment provided above, 
Grant_0 line 204 0 corresponds to the least significant bit and Grant_n-1 line 204 n - 
! corresponds to the most significant bit. Thus, the embodiment above 
corresponds to a round robin scheme implementation wherein a "next" grant 
within a round robin cycle is issued by shifting an active bit to a next higher 
vector column. 

[0029] The round robin control unit 202 of Figure 2 not only provides a 
"vector" as an output signal across grant lines 204 but also receives a "vector" as 




an input signal across request lines 203. That is, as seen in Figur 2, n request 
lines 203 (labeled as Req_0 203 0 through Req_n-1 203 n -i) are sent to the round 
robin control unit 202 from queues 201 0 through 201 n -i. In the embodiment of 
Figure 2, a queue that desires to release output traffic (e.g., such as any queue 
having en-queued information that is waiting to be released) activates its 
appropriate request line. 

[0030] That is, if queue 201 0 desires to release traffic the Req_0 line 203 0 is 
activated (e.g., is given an active logic signal); if queue 201 1 desires to release 
traffic the Req_1 line 203i is activated;...; and; if queue 201 n -i desires to release 
traffic the Req_n-1 line 203 n .i is activated. Thus, the request lines 203 may be 
collectively viewed as an input vector (to the round robin control unit 202) that 
continually changes as traffic is added to and removed from the queues. 
[0031] That is, when a queue is emptied (e.g., after it has been issued a grant) 
the request line signal for the queue is changed from active to inactive; and, after 
an emptied queue becomes non-empty (e.g., just after an emptied queue 
receives new information for enqueueing) the request line signal for the queue is 
changed from inactive to active. As each queue in the collection of queues 201 0 
through 201 n -i changes its state (from empty to non empty or non empty to 
empty) the input vector to the round robin control unit changes. Note that a 
queue's state corresponds to whether or not a queue is empty (e.g., the queue 
may be said to have an empty state or a non empty state). 
[0032] Note that a single input vector can have more than one active request 
line. For example, if queues 201 0 and 202! are simultaneosuly non-empty, an 
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active signal will be presented on request lines ReqJ) 203 0 and Req_1 203^ for 
as long as these queues 201 0 , 202i remain simultaneously non-empty. Note 
that, in a sense, multiple active requests within a single input vector may be 
viewed as a manifestation of the contention for resources. That is, continuing 
with the example provided just above, if the request lines Req_0 203 0 and Req_1 
203i are simultaneosuly activated, their respective queues 201 0 and 202i are 
effectively competing with one another for a grant from the round robin control 
unit 202. 

[0033] The round robin control unit 202 "decides", in an manner that is 
consistent with a round robin approach, which of the input vector's active 
request(s) is to be positively responded to by the issuance of a grant along its 
corresponding grant line (e.g., if a request on Req_0 line 203 0 is recognized as 
being an appropriate round robin selection, grant line Grant_0 204 0 is given an 
active signal). In various embodiments, the round robin control unit 202 may be 
designed to operate in a synchronous fashion (e.g., by issuing output vectors in 
pace with a clock signal). The input vectors may also be provided in a 
synchronous fashion as well. Asynchronous designs are also possible. 
[0034] Note also that, as a more expansive or general depiction, queues 201 0 
through 201 n -i may be replaced with (or viewed as) "request agents" as 
discussed in the background (rather than queues). Figure 3a shows an 
embodiment 302 of a round robin control unit that may be used for the round 
robin control unit 202 of Figure 2. In the depiction of Figure 3a, grant lines 304 
may be viewed as corresponding to grant lines 204 of Figure 2; and, request 
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lines 303 may be viewed as corresponding to request lines 203 of Figure 2. 
Note that the request lines 303 are drawn as a bus-like structure. 
[0035] As seen in Figure 3a, the collection of request lines 303 (i.e., Req_0 
through Req_n-1) are presented to each of n unique grant generation units 305 0 
through 305 n -i. Each grant generation unit has circuitry capable of generating an 
entire output vector for presentation across grant lines 304. The particular grant 
generation unit that is used to provide the round robin control unit 302 output 
vector 304 corresponds to the request agent being recognized as grant eligible. 
[0036] That is, if the first request agent is recognized as being grant eligible 
(e.g., queue 201 0 in Figure 2), the first (k=0) grant generation unit 305 0 provides 
the output vector 304 of the round robin control unit; if the second request agent 
is recognized as being grant eligible (e.g., queue 201 1 in Figure 2), the second 
(k=1) grant generation unit 305i provides the output vector 304 of the round robin 
control unit; ...; and if the last request agent is recognized as being grant eligible 
(e.g., queue 201 n -i in Figure 2), the last (k=n-1) grant generation unit 305 n -i 
provides the output vector 304 of the round robin control unit. 
[0037] As grant eligibility can be determined by which request agent last 
received a grant (e.g., if request agent k = 0 last received a grant, the next 
request agent to be grant eligible will be request agent k = 1), the particular 
embodiment of Figure 3a is designed to use a particular grant generation unit in 
response to which grant line was last activated. Figure 4 shows a methodology 
400 that illustrates this aspect in more detail. 
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[0038] Referring to Figur s 3a and 4, according to the methodology of Figure 
4: 

1) if the Grant_0 line 304 0 was activated within the previous output 
vector, the k=1 grant generation unit 305i is used to generate 402^ 
the next output vector; 

2) if the Grant_1 line 304! was activated within the previous output 
vector, the k=2 grant generation unit 305 2 is used to generate 402 2 
the next output vector; 

3) if the Grant_2 line 304 2 was activated within the previous output 
vector, the k=3 grant generation unit 305 3 is used to generate 402 3 
the next output vector; 

n-2) if the Grant_n-3 line (not shown in Figure 3a) was activated within 

the previous output vector, the k=n-2 grant generation unit 305 n -2 is 

used to generate 402 n -2 the next output vector; 
n-1) if the Grant_n-2 line 304 n -2 was activated within the previous output 

vector, the k=n-1 grant generation unit 305 n -i is used to generate 

402 n -i the next output vector; and 
n) if the Grant_n-1 line 304 n -i was activated within the previous output 

vector, the k=0 grant generation unit 305 0 is used to generate 402 0 

the next output vector. 



12 



[0039] As seen in Figure 3a, the output lines 304 0 through 304 n -i are fedback 
so that the appropriate grant generation unit to be used for the generation of the 
next output vector can be determined. The fedback output lines 304 0 through 
304 n .i may be delayed by delay unit 31 0 so that the rate at which output vectors 
are formulated is controlled. Alternatively or in combination, the rate at which 
output vectors are formulated may depend upon a clock that helps time their 
presentation across output 304. The exact manner as to how the fed back output 
lines 304 0 through 304 n -i are used to determine the appropriate grant generation 
unit may vary from embodiment to embodiment. 

[0040] In the embodiment of Figure 3a, a grant generation unit selector 307 is 
responsible for enabling an appropriate multiplexer 306 input channel in 
response to each output vector that is presented across the grant lines 304. That 
is, if the Grant_0 line 304 0 was activated within the previous output vector, the 
second multiplexer input channel 306i is enabled (so that the next output vector 
is provided by the k=1 grant generation unit 305i); if the GranM line 304! was 
activated within the previous output vector, the third multiplexer input channel 
306 2 is enabled (so that the next output vector is provided by the k=2 grant 
generation unit 305 2 ), etc. 

[0041] As such, the input to the grant generation unit selector 307 
corresponds to the previous output vector (as presented at round robin control 
unit 304) shifted upwards by one bit. This automatically provides a "one hot" 
indication of the appropriate grant generation unit to use for the next output word. 
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For example, if the previous output vector at output 304 corresponds to 00... 001 , 
the input vector presented to the grant generation unit 307 is 00... 010. 
[0042] In an embodiment, the grant generation unit selector 307 corresponds 
to a "one hot" to "binary" converter where the multiplexer 306 channel select 
input 31 1 accepts binary rather than a "one hot" input. Alternatively, the 
multiplexer 306 may be designed to accept a "one hot" formatted input (in which 
case the selector 307 can simply pass its input vector to the multiplexer 306). In 
other alternate embodiments, the function of the grant generation selection unit 
307 may be built into the grant generation units themselves (e.g., in a distributed 
fashion) as observed in Figure 3b. 

[0043] In the embodiment of Figure 3b, the input vector to the grant 
generation selection unit 307 (as drawn in Figure 3a) is distributed across the 
collection of grant generation units 31 5 0 through 315 n -i. In this embodiment, the 
output lines 304 0 through 304 n -i are routed in a feedback arrangement (i.e., 
"fedback") and used as individual enable lines. That is, the Grant_n-1 line 304 n -i 
is used as an enable line for the first grant generation unit 31 5 0 , the Grant_0 line 
304 0 is used as an enable line for the second grant generation unit 31 5i; the 
GranM line 304! is used as an enable line for the second grant generation unit 
31 5 2 , etc. 

[0044] In the embodiment of Figure 3b, each non enabled grant generation 
unit produces a null vector (00... 000) which presents no logical weight to the 
vector input OR gate 320 and is therefore ignored. As such, if the grant line 
presented to a particular grant generation unit is non active, the grant generation 
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unit will have no effect on the output 304. As a result, the grant generation unit 
whose "enabling" input grant line was last activated will provide the next output 
vector 304 of the round robin control unit 302. 

[0045] Figure 5a shows an embodiment 505a of a kth grant generation unit 
that may be used for any of the grant generation units 305 0 through 305 n -i of 
Figure 3a. Recall that each of the grant generation units 305o through 305 n -i of 
Figure 3a have circuitry that produces an output vector of size n. As such, the 
output 506 of the grant generation unit 505a of Figure 5a may be viewed as 
corresponding to any of the grant generation unit outputs 306 0 through 306 n -i of 
Figure 3a (depending on which grant generation unit of Figure 3a the grant 
generation unit 505a of Figure 5a corresponds to).. 

[0046] Referring to the round robin control unit 302 embodiment of Figure 3a, 
note that each of the grant generation units 305 0 through 305 n -i also receives an 
input vector 303 having size n. As such, the input 503 to the grant generation 
unit 505a embodiment of Figure 5a is represented as an input vector of size n. 
Recall that the input vector corresponds to a vector of requests. For example, as 
discussed with respect to Figure 2, the input vector 203 corresponds to a "one 
hot" vector of n requests where one request is reserved for each of queues 201 0 
through 201 n -i. 

[0047] More generally, referring to the embodiment of Figure 5a, the input 
vector 503 may be viewed as a group of n individual requests for some type of 
service or access to resources that are contended for by a group of request 
agents. For example, the requests may correspond to requests for bandwidth 
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resources; switching or routing resources (e.g., within a network or networking 
system); processing resources (e.g., within a computing network or computing 
system); or other types of services or resources not listed just above. 
[0048] In the approach of Figure 5a, the request input vector 503 is organized 
to reflect the grant eligibility hierarchy of the round robin cycle. That is, in the 
example of Figure 5a, the least significant bit of the vector (ReqJD 503o) is from 
the first request agent to be recognized as grant eligible during a round robin 
cycle (e.g., queue 201 0 of Figure 2); the next significant bit beyond the least 
significant bit of the vector (Req_1 503i) is from the second element to be 
recognized as being grant eligible during a round robin cycle (e.g., queue 201 1), 
... the most significant bit (Req_n-1 503 n -i) is from the last request agent to be 
recognized as being grant eligible during a round robin cycle. 
[0049] As a round robin cycle may be viewed as the continuous passing of 
grant eligibility to each request agent and the reaching of a disposition in 
response thereto, the grant generation unit 505a of Figure 5a may be viewed as 
having a feature that manifests which request agent is being initially recognized 
as grant eligible and an element that reaches a disposition. Accordingly, with 
respect to Figure 5a, the "rolled" [(n-k):k] vector represents the former and the 
least significant active bit extraction unit 510 represents the later. 
[0050] More specifically, the input vector 503 is "rolled" such that the request 
agent being recognized as grant eligible is positioned as the least significant bit 
with respect to the input vector that is presented to the least significant active bit 
extraction unit 510. Recalling from the discussion of Figures 3a and 3b that the 
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use of particular grant generation unit is triggered by its corresponding request 
agent (k) being recognized as grant eligible because the previous request agent 
in the round robin cycle (k-1) was just issued a grant, note that each grant 
generation unit 305 0 through 305 n -i will have a different "rolled" vector topology in 
order to properly reflect the particular request agent that is being recognized as 
grant eligible. 

[0051] Figure 6 explores this in more detail. Figure 6 shows an embodiment 
of the "rolled" input vector arrangements that may be used for each of the grant 
generation units 305 0 through 305 n -i of Figure 3. That is, grant generation units 
605 0 through 605 n -i of Figure 6 correspond to, respectively, grant generation 
units 305 0 through 305 n -i of Figure 3a. As such, note that the "k" term in each 
grant generation unit corresponds to the request agent that is recognized as 
being grant eligible; and that the input vector 503 is rolled such that first k bits are 
placed in the most significant vector locations. 

[0052] The rolling of the input vector causes the least significant bit presented 
to the least significant active bit extraction unit 51 0 as the input vector bit that 
was generated from the request agent being recognized as grant eligible. For 
example referring to Figure 6, with respect to the first grant generation unit 605 0 , 
the first request agent (k=0) is recognized as grant eligible. As such the input 
vector 603 is not rolled and the request value (whether active or not active) 
issued by the first request agent is the least significant bit that is presented to the 
least active significant bit extraction unit 61 0 0 . 
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[0053] Similarly, with respect to the second grant generation unit 605i, the 
second request agent (k=1) is recognized as grant eligible. As such the input 
vector 603 is rolled by one bit and the request value issued by the second 
request agent is the least significant bit that is presented to the least active 
significant bit extraction unit 61 0 0 . Note that the request value issued by the first 
request agent becomes the most significant bit within the second grant 
generation unit 605i of Figure 6. 

[0054] The rolling of the input vector 603 continues in likewise fashion until, 
with respect to the last grant generation unit 605 n -i, the last request agent (k=n-1) 
is recognized as grant eligible. As such the input vector 603 is rolled by k-1 bits 
and the request value issued by the last request agent is the least significant bit 
that is presented to the least active significant bit extraction unit 61 0 0 . Note that 
the request values issued by the first through next-to-last request agents (k=0 
through k=n-2) become the most significant bits within the second grant 
generation unit 605i of Figure 6. 

[0055] Each of the least significant bit extraction units 61 0 0 through 61 0 n -i are 
designed to extract the least significant active bit of the rolled input vector they 
are presented with. That is, as a few examples, if a positive logic rolled input 
vector corresponds to 1 1 ... 1 1 1 , a least significant bit extraction unit will emit an 
output vector of 00... 001 ; or, if a positive logic rolled input vector corresponds to 
1 1 ... 1 1 0, a least significant bit extraction unit will emit an output vector of 
00... 010; or, if a rolled input vector corresponds to 10... 000, the least significant 
bit extraction unit will emit an output vector of 10... 000. 
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[0056] That is, as significance increases from right to left in the examples 
above, the least active significant bit extraction process may be viewed as 
effectively scanning the rolled input vector from the rightmost bit toward the left 
until an active bit is found. This bit corresponds to the least active bit of the rolled 
input vector. Once the least significant active bit is found, an output vector is 
created that presents the least active significant bit in an isolated fashion (thus 



ignoring active bits beyond the least significant active bit). Note that in actual 
implementations, the least significant bit extraction circuitry does not need to 
actually "scan" the rolled input vector. For example, basic combinatorial logic 
may be designed that simply passes forward the least significant bit. 
[0057] The least significant bit extraction process effectively makes one or 
more grant dispositions over a region of the round robin cycle hierarchy. The 
region over which the dispositions originate starts with the request agent that is 
recognized as being grant eligible (i.e., the kth request agent) and ends with the 
first request agent at or beyond the kth request agent who is actively asserting a 
request. For example, if n = 9 and the input vector 603 that is presented to the 
second (k=1) grant generation unit 605i is "100000101", the rolled input vector 
that is presented to the least significant active bit extraction unit 61 0i will be 
"110000010". 

[0058] That is, in accordance with the manner in which the input vector 603 is 
rolled within the second grant generation unit 605i of Figure 6, the first bit of 
input vector 603 is rolled from the least significant bit position (0) to the most 
significant bit position (n-1). In response to being presented with a "1 10000010" 
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vector, the least significant active bit extraction unit 61 Oi will provide at its output 
a "000000010" vector. Noting that this vector (i.e., that was just produced by the 
least significant active bit extraction unit 61 0i) is rolled back to the original format 
of input vector 603 (e.g., is "unrolled" so that the round robin cycle hierarchy is 
preserved), the output vector 6O61 of the second grant generation unit 605i will 
be "000000100". 

[0059] A few comments are in order from this example. Firstly, the rolling 
activity performed upon the input vector 603 effectively: 1) identifies which of the 
request agents is to be recognized as grant eligible within the framework of the 
round robin hierarchy; and, 2) builds the fairness of the round robin approach. 
Note that the status of the input vector 603 ("100000101") indicates that the first 
(k=0), third (k=2) and ninth (k=8) request agents are contending for resources. 
As the second grant generation unit 605i is designed to operate after the first 
(k=0) request agent has been issued a grant, the positioning of the request from 
the second (k=1) request agent as the least significant bit (via the input vector 
603 rolling process) effectively identifies the second request agent as being grant 
eligible. 

[0060] Furthermore, note that the rolling of the request from the first request 
agent to the most significant bit effectively makes the first request agent "last in 
line" to receive the next grant. This corresponds to the fairness aspect of the 
round robin technique. That is, as the first resource agent has just received a 
grant, the remaining resource agents should be recognized as being grant 
eligible before the first resource agent is again recognized. 




[0061] As a second observation, note that the least significant active bit 
extraction unit 61 d effectively makes a pair of dispositions for the second and 
third request agents. That is, after being presented with a vector of "1 10000010", 
the least significant active bit extraction unit 61 0i effectively scans the vector 
from right to left in search of the first active bit that is presented. As the least 
significant bit corresponds to the request value issued by the second (k=1) 
request agent, the second request agent is, effectively, initially recognized as 
being grant eligible. 

[0062] In this case, however, the least significant bit is inactive. That is, the 
second request agent is not requesting service. As such, the next request agent 
(i.e., the third (k=3) request agent) is effectively recognized as being grant 
eligible by scanning to the next significant bit. Here, the bit is found to be active. 
As such, the third (k=3) request agent is actively requesting service. Therefore, 
with the least significant active bit having been found, the least significant active 
bit extraction unit 61 0i presents a vector of "000000010". Thus, the least 
significant active bit extraction unit 61 0i has effectively made a disposition for the 
second request agent (which resulted in a disposition that a grant should not be 
issued) and the third request agent (which resulted in a disposition that a grant 
should be issued). 

[0063] The "00000001 0" vector produced by the least significant active bit 
extraction unit 61 d is then "unrolled" to form a "000000100" output vector at the 
output 6O61 of the second grant generation unit 605i. This vector is then 
presented at the output 304 of the round robin control unit 302 (referring briefly 
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back to Figure 3a) which signifies that the third (k=2) request agent is being 
issued a grant. As such, the fourth (k=3) grant generation unit 305 3 will provide 
the next output vector from the round robin control unit 302. Note that, consistent 
with the round robin hierarchy, the fourth (k=3) grant generation unit 3053 (as a 
result of its input vector rolling process) will initially identify the fifth (k=4) request 
agent as being grant eligible. 

[0064] Figure 5b shows an embodiment of a grant generation unit 51 5b that 
may be used for each of the grant generation units 51 5 0 through 515 n -i of Figure 
3b. Note that the enabling input 504 k . 1 of Figure 5b controls the output of the 
grant generation unit 515b. For example, as seen in Figure 5b, the enabling 
input 504^ is individually ANDed with each bit from the "un-rolled" least 
significant bit extraction unit 510 output vector. Thus, if the enabling input 504k.! 
is inactive (e.g., a "0"), the grant generation unit 515b produces a "null" vector 
(e.g., 00... 000) where each bit in the output 506 is inactive. If the enabling input 
504 k -i is active (e.g., a "1"), the grant generation unit 515b produces the "un- 
rolled" least significant bit extraction unit 510 output vector. 
[0065] For simplicity, this operation has been drawn as a single AND gate 520 
but, in actuality may be implemented as n AND gates having a first input tied to 
an un-rolled vector bit and a second input tied to the enabling input 504 k -i. Lastly 
note that, consistent with the embodiment of Figure 3b, the enabling input 504k.! 
may correspond to 504n-1 when k = 0. 
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Discussion of Round Robin Circuit Design Implementation 
Embodim nts 

[0066] Behavioral and/or RTL descriptions (such as Verilog descriptions and 
VHDL descriptions) are commonly used to describe a digital circuit design. 
Typically, the behavioral level description of a circuit corresponds to a functional 
description of the operation of the circuit. The behavioral level description 
commonly resembles a software program because both (i.e., a behavioral level 
description and a software program) define a methodology or sequence of 
operations. 

[0067] An RTL description is somewhat similar to a behavioral description (in 
that it resembles a software program) but also tends to include various hardware 
restrictions (such as the naming of nets on a bus, etc.). An RTL description is 
typically synthesized into a gate level netlist which describes the manner in which 
logic circuit elements (e.g., gates, registers, etc.) are to be interconnected 
together to form a semiconductor circuit that performs the methodology outlined 
in the behavioral level description. Other types of descriptions that have various 
characteristics of both behavioral and RTL level descriptions are also possible. 
[0068] The following discussion describes various approaches as to how the 
round robin approaches discussed above can be implemented at an RTL level so 
that efficiencies are realized when gate level netlist is produced. Figure 7 shows 
an embodiment 700 of an RTL level description that describes the operation of a 
round robin control unit that conforms to the embodiment 302b of Figure 3b. In 
the embodiment of Figure 7, an n=20 design is described. Each of the unique 
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"prio" calls within description space 702 corresponds to a grant generation unit. 
As 20 prior calls are implemented within the embodiment of Figure 7, 20 grant 
generation units may be created when the behavioral level description is 
compiled into a netlist (or manufactured into a semiconductor chip). 
[0069] The "prio" subfunction that is being called in each of the prio calls 702 
observed in Figure 7 corresponds to an "extract least significant bit" function. As 
such, the input definitions to each of the prio calls 702 observed in Figure 7 
define the rolling of the input vector; and, the output definitions from each of the 
prio calls 702 observed in Figure 7 define the "un-rolling" of the vector that is 
produced by the least significant bit extraction process. Note also that each prior 
call includes an enabling input statement ".en(..) n that corresponds to the 
enabling input 504 k .i of Figure 5b. Also, note that statement 703 corresponds to 
a large OR function which corresponds to the OR gate 320 observed in Figure 
3b. 

[0070] Figure 8 shows an embodiment of the "prio" subfunction referred to 
just above. As mentioned just above, the prio subfunction embodiment of Figure 
8 describes an "extract least significant bit" process. The methodology described 
in Figure 8 effectively scans the input vector in a direction from least significance 
to most significance and defines the appropriate output value to be generated in 
response for each bit location in the vector as it is scanned. The first active bit 
discovered produces the corresponding output value. 

[0071] Figure 9 shows an embodiment that divides the scanning process into 
four parallel operations. That is, a first 5 bit section of the rolled input vector 
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(4:0), a second 5 bit section of the rolled output vector (9:5), a third 5 bit section 
of the rolled input vector (14:10); and a fourth 5 bit section of the rolled output 
vector (19:15) are scanned in parallel. Each of these parallel operations are 
described in description spaces 901 through 904 respectively. Note that the 
least significant bit in each region (if any) is identified by each scanning process. 
Statement 905 calculates the proper output from the results of the parallel 
scanning operations. 

[0072] Note that the use of commercially available synthesizers (e.g., 
Synopsis) have algorithms for reducing and simplifying logic implementations. 
Approaches similar to that of Figure 8 (where the output is predefined from a 
serial scanning operation) have been found to consume logic gates when 
implemented; while approaches similar to Figure 9 (where the output is 
calculated from a series of parallel scanning operations) have been found to 
exhibit less propagation delay. Note also that the synthesis process may create 
circuit implementations that vary from either of the approaches shown in Figures 
3a or 3b. 

Additional Comments 
[0073] It is also important to point out that the "one hot" encoding features of 
the above described round robin circuit embodiments and circuit design 
implementation embodiments may be converted to binary encoding at various 
locations where possible. For example, referring to Figure 2, the round robin 
control unit 202 output 204 and/or input 202 may be binary encoded. Thus, the 




present discussion embraces embodimets other than those that are only "one 
hot" encoded. 

[0074] Furthermore, note that a family of alternative embodiments can be 
created through the use of a "most significant bit extraction process" and an input 
vector rolling approach that rolls vectors in an opposite direction from that 
observed in Figures 5a and 5b. For example, a "one hot" vector can be 
scanned in a most significant to least significant direction (e.g., from left to right 
for a given input vector). As such, the most significant kth bits would be rolled to 
the least significant region of the vector (rather than having the least significant 
kth bits rolled toward the most significant region of the vector as described with 
respect to Figures 5a, 5b and 6). Embodiments that position the bit from the 
request agent (who is to be recognized as being grant eligible) somewhere 
between the most significant and least significant bits of the rolled input vector 
are also possible. 

[0075] Note also that embodiments of the present description may be 
implemented not only within a semiconductor chip but also within machine 
readable media. For example, the designs discussed above may be stored upon 
and/or embedded within machine readable media associated with a design tool 
used for designing semiconductor devices. Examples include a netlist formatted 
in the VHSIC Hardware Description Language (VHDL) language, Verilog 
language or SPICE language. Some netlist examples include: a behaviorial level 
netlist, a register transfer level (RTL) netlist, a gate level netlist and a transistor 
level netlist. Machine readable media also include media having layout 
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information such as a GDS-II file. Furthermore, netlist files or other machine 
readable media for semiconductor chip design may be used in a simulation 
environment to perform the methods of the teachings described above. 
[0076] Thus, it is also to be understood that embodiments of this invention 
may be used as or to support a software program executed upon some form of 
processing core (such as the CPU of a computer) or otherwise implemented or 
realized upon or within a machine readable medium. A machine readable 
medium includes any mechanism for storing or transmitting information in a form 
readable by a machine (e.g., a computer). For example, a machine readable 
medium includes read only memory (ROM); random access memory (RAM); 
magnetic disk storage media; optical storage media; flash memory devices; 
electrical, optical, acoustical or other form of propagated signals (e.g., carrier 
waves, infrared signals, digital signals, etc.); etc. 

[0077] In the foregoing specification, the invention has been described with 
reference to specific exemplary embodiments thereof. It will, however, be 
evident that various modifications and changes may be made thereto without 
departing from the broader spirit and scope of the invention as set forth in the 
appended claims. The specification and drawings are, accordingly, to be 
regarded in an illustrative rather than a restrictive sense. 
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